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NEMATICIDAL PROTEINS 

The present invention relates to inter alia, nematicidal, nucleic acid sequences 
encoding them and transformed plants containing them. In particular the invention relates to 
5 transformed crops having increased resistance and/or tolerance to nematodes. 

Nematodes are essentially simple worms generally having cylindrical bodies that 
taper towards the head and tail. Nematodes that are of agricultural importance can be 
subdivided into endoparasites, ectoparasites and ecto-endoparasites of plants. Some are 
sedentary and others remain mobile as they feed. All use a stylet to pierce plant cell walls 

10 and feed by removing plant cell contents before or after plant cell modification. More detail 
of particular important genera and species, their host ranges and economic importance are 
defined in well known standard texts. The genera Heterodera and Globodera cyst 
nematodes are important crop pests. They include H. glycines, (soybean cyst nematode) H. 
schachtii (beet cyst nematode), H. avenae (cereal cyst nematode) and potato cyst nematodes 

15 G rostochiensis and G. pallida. Root-knot nematodes particularly the genus Meloidogyne, 
damage a wide range of crops. Examples are species M. javanica, M. hapla, M arenaria 
andM incognita. There are many other economically important nematodes. Both the above 
groups produce swollen sedentary females as do other economic genera including 
Rotylenchulus, Nacobbus, and Tylenchulus. Other economic nematodes remain mobile as 

20 adult females and many of these cause damage to a wide range of crops. Examples include 
species oiDitylenchus, Radopholous, Pratylenchus, Helicotylenchus and Hirschmanniella. 
Others do not always enter plants but feed from them as ectoparasites. Examples include 
Aphelenchoides, Anguina Criconemoides, Criconema Hemicycliophora, 
Hemicriconemoides, Paratylenchus and Belonolaimus. Among the ectoparasites the genera 

25 Xiphinema, Longidorus, Paralongidorus, Trichodorus and Paratrichodrus have distinctive 
importance. They cause damage to crops by their feeding but their economic status as crop 
pests is often due to roles as vectors of inter alia, NEPO and TOBRA plant viruses. 

Plant parasitic nematodes are reported to be responsible for the world-wide loss of 
billions of pounds worth of agricultural crops each year and so the professional agriculturist . 

30 is constantly seeking ways to reduce these losses whilst maintaining a high quality 
commercial crop. 

The present invention therefore seeks to provide, inter alia, proteins that are active 
against nematode. 
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According to the present invention there is provided a nematicidal protein comprising 
or consisting of the sequence of SEQ ID NO: 1. 

The present invention further provides a nematicidal protein having at least 70% 
identity to the protein of SEQ ID NO: 1. In a still further embodiment said protein has at 
least 75% identity to the protein of SEQ ID NO: 1. In a still further embodiment said protein 
has at least 80% identity to the protein of SEQ ID NO: I. In a still further embodiment said 
protein has at least 85% identity to the protein of SEQ ID NO: 1. In a still further 
embodiment said protein has at least 90% identity to the protein of SEQ ED NO: 1 . In a still 
further embodiment said protein has at least 91% identity to the protein of SEQ ID NO: 1. In 
a still further embodiment said protein has at least 92% identity to the protein of SEQ ID 
NO: 1 . In a still further embodiment said protein has at least 93% identity to the protein of 
SEQ ID NO: L In a still further embodiment said protein has at least 94% identity to the 
protein of SEQ ID NO: 1 . In a still further embodiment said protein has at least 95% identity 
to the protein of SEQ ID NO: 1. In a still further embodiment said protein has at least 96% 
identity to the protein of SEQ ID NO: 1. In a still further embodiment said protein has at 
least 97% identity to the protein of SEQ ID NO: 1 . In a still further embodiment said protein 
has at least 98% identity to the protein of SEQ ID NO: 1. In a still further embodiment said 
protein has at least 99% identity to the protein of SEQ ID NO: 1. 

The percentage of sequence identity for proteins is determined by comparing two 
optimally aligned sequences over a comparison window, wherein the portion of the amino 
acid sequence in the comparison window may comprise additions or deletions (i.e. gaps) as 
compared to the initial reference sequence (which does not comprise additions or deletions) 
for optimal alignment of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical amino acid residue occurs in both sequences to 
yield the number of match positions, dividing the number of match positions by the total 
number of positions in the window of comparison and multiplying the result by 100 to yield 
the percentage of sequence identity. When calculating the percentage sequence identity the 
sequences may be aligned allowing for up to 3 gaps with the proviso that in respect of the 
gaps, a total of not more than 15 amino acid residues is affected. Optimal alignment of 
sequences for comparison may also be conducted by computerised implementations of 
known algorithms. In a particular embodiment of the present invention the sequence identity 
is calculated using the FASTA version 3 algorithm which uses the method of Pearson and 
Lipman (Lipman, DJ. and Pearson, W.R. (1985) Rapid and sensitive protein similarity 
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searches and Science. 227:1435-1441 and Pearson, W.R. and Lipman, DJ. (1988) Improved 
tools for biological sequence comparison. PNAS. 85:2444-2448) to search for similarities 
between the reference sequence (also termed the query sequence) and any group of 
sequences (termed further sequences). There are also algorithms available to the person 

5 skilled in the art that enable a calculation of the percentage sequence identity between 
polynucleotide sequences. 

The protein may differ from the basic nematicidal protein of SEQ ID NO: 1 by 
conservative or non-conservative amino acid substitutions. A conservative substitution is to 
be understood to mean that the amino acid is replaced with an amino acid with broadly 

10 similar chemical properties. In particular conservative substitutions may be made between 
amino acids within the following groups: 



(•) 


Alanine and Glycine; 


00 


Serine and Threonine; 


(ii) 


Glutamic acid and Aspartic acid; 


(»i) 


Arginine and Lysine; 


(iv) 


Asparagine and Glutamine; 


(v) 


Isoleucine and Leucine, 


(vi) 


Valine and Methionine; 


(vii) 


Phenylalanine and Tryptophan. 



20 In general, more conservative than non-conservative substitutions will be possible 

without destroying the nematicidal properties of the proteins. Suitable variants may be 
determined by testing nematicidal properties of the peptide using routine methods that are 
well known to the person skilled in the art. 

In one embodiment, the nematicidal protein of the invention is processed to remove 
25 the N-terminal methionine residue. In a further embodiment, the processed protein is 
modified, for example, by acetylation at the N-terminus. 

The present invention still further provides a nematicidal protein as described above 
having a specific lactose binding ability. 

The present invention still further provides a nematicidal protein as described above 
30 having the ability to cleave disaccharides and/or polysaccharides. In a further embodiment, 
the protein is a mannanase. 

In one aspect of the present invention, the nematicidal protein described above is 
obtainable from Lepista nuda. 
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The present invention further provides a polynucleotide which encodes a protein as 
described above. The person skilled in the ait will appreciate that there are a number of 
polynucleotides which can encode each protein due to the degeneracy of the genetic code. In 
a particular embodiment of the invention the polynucleotide comprises or consists of the 
sequence of SEQ ED NO: 2. 

The present invention further provides a polynucleotide which is the complement of 
one which hybridises to a sequence as described above under stringent conditions and 
wherein said polynucleotide still encodes a protein which is nematicidal. Such stringent 
conditions are well known to the person skilled in the art and comprise, for example: 
hybridisation at a temperature of about 65°C in a solution containing 6 x SSC, 0.01% SDS 
and 0.25% skimmed milk powder, followed by rinsing at the same temperature in a solution 
containing 0.2 x SSC and 0.1% SDS followed by confirmation that the polynucleotide so 
identified still encodes a nematicidal protein according to the invention. The skilled man 
may alternatively select the following hybridisation conditions viz,, hybridisation at a 
temperature of between 60°C and 65°C in 0.3 strength citrate buffered saline containing 
0.1% SDS followed by rinsing at the same temperature with 0.3 strength citrate buffered 
saline containing 0.1% SDS followed by confirmation that the polynucleotide so identified, 
still encodes a nematicidal protein according to the present invention. 

Further polynucleotides according to the present invention may be identified from 
nucleic acid libraries. Suitable oligonucleotide probes may be constructed on the basis of the 
amino acid sequence information of the proteins according to the present invention and used 
to screen any such nucleic acid library for the identification of further polynucleotides 
encoding proteins according to the invention. In a particular embodiment of the present 
invention the a polynucleotide selected from the group consisiting of SEQ ID NOs: 2, 30, 32, 
34, 36 and 37 may be used for the construction of oligonucleotide probes by the skilled 
person. The person skilled in the art is well versed in methods for the production and 
screening of nucleic acid libraries and the necessary techniques for the subsequent 
identification, isolation and sequence determination of polynucleotides which encode further 
nematicidal proteins in accordance with the present invention. The person skilled in the art 
will appreciate that alternative methods exist for the identification and characterisation of 
related sequences from inter alia, plant/fungal sources. Such methods include PCR 
strategies based on oligonucleotide primers using the sequence information provided herein 
or from sequences obtainable by the methods described above. 
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In one aspect of the present invention, the polynucleotide described above is 
obtainable from Lepista nuda. 

The present invention still further provides a polynucleotide having a first region 
encoding a protein as described above and a second region encoding a further protein. The 
5 regions may be separated by a region which provides for a self-processing polypeptide which 
is capable of separating the proteins such as the self-processing polypeptide described in US 
Patent No. 5,846,767 or any similarly functioning element. Alternatively the regions may be 
separated by a sequence which acts as a target site for an external element which is capable 
of separating the protein sequences. Alternatively the polynucleotide may provide for a 

10 polyprotein which comprises a plurality of protein functions. In a further embodiment of the 
present invention the proteins of the polyprotein may be arranged in tandem. These 
polyproteins may comprise the proteins according to the present invention and optionally 
further proteins such as those encoding any desired agronomic trait, for example, a further 
agronomic trait selected from the group consisting of herbicide resistance, insect resistance, 

15 fungus resistance, nematode resistance, altered stress tolerance, altered yield and altered 
nutritional content. In a particular embodiment of the present invention the further 
agronomic trait provides resistance to a herbicide which comprises glyphosate acid or 
agriculturally acceptable salt thereof. In a particular embodiment the polynucleotide may be 
used to provide the proteins and proteins compositions of the present invention that are 

20 described herein. 

The present invention still further provides a method of evolving a polynucleotide 
which encodes a protein having nematicidal properties comprising: (a) providing a 
population of variants of said polynucleotide and further polynucleotides which encode 
further proteins, at least one of which is in cell free form; and (b) shuffling said variants and 

25 further polynucleotides to form recombinant polynucleotides; and (c) selecting or screening 
for recombinant polynucleotides which have evolved towards the said nematicidal 
properties; and (d) repeating steps (b) and (c) with the recombinant polynucleotides 
according to step (c) until an evolved polynucleotide which encodes a protein having 
nematicidal properties has been acquired wherein said population of variants in part (a) 

30 contains at least one polynucleotide as described in the preceding paragraphs. The methods 
for evolving a polynucleotide as described above are well known to the person skilled in the 
art and are described inter alia, in US Patent No. 5,81 1 ,238. 
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The present invention still further provides a construct comprising a polynucleotide 
as described above. In a further embodiment, the construct comprises a polynucleotide as 
described above operably linked to a transcription initiation region and a transcription 
termination region. In a particular embodiment, the transcriptional initiation region and/or 
the transcription termination region are operable in plants. In a still further embodiment of 
the present invention the construct may further comprise a region which provides for the 
targeting of the protein product to a particular location. For example if it is desired to 
provide the protein outside of the cell then an extracellular targeting sequence may be ligated 
to the polynucleotide encoding the protein of the present invention. Other examples of 
targeting include targeting to a specific intracellular organelle or compartment. In a still 
further embodiment of the present invention the construct may further comprise a region 
which provides for the retention of the protein at a specific location such as a particular 
organelle or compartment. For example if it is desired to retain a protein in the endoplasmic 
reticulum, the KDEL endoplasmic reticulum retention sequence may be used. The present 
invention still further provides a construct as described above which further comprises a 
region which provides for the production of a selectable marker. In a particular embodiment, 
the selectable marker is provided by the mannose isomerase (Positech™) selection system. 
The selectable marker may, in particular, confer resistance to kanamycin, hygromycin or 
gentamycin. Further suitable selectable markers include genes that confer resistance to 
toxins such as eutypine or that confer resistance to herbicides such as protoporphyrinogen 
oxidase (PPO) inhibitor-based herbicides or glyphosate-based herbicides. In a particular 
embodiment, the gene may be a mutated PPO such as the ones described in PCT Publication 
No. WO 97/3201 1 and PCT Publication No. WO 95/34659 and found, for example, in 
Acuron™ technology. Other forms of selection are also available such as hormone based 
selection systems such as the Multi Auto Transformation (MAT) system of Hiroyrasu 
Ebinuma et al 1997. PNAS Vol. 94 pp21 17-2121; visual selection systems which use the 
known green fluorescence protein, P glucoronidase and any other selection system such as 
xylose isomerase and 2-deoxyglucose (2-DOG). 

The present invention still further provides a construct as described above wherein 
the plant operable promoter is selected from the group consisting of: CaMV35S, FMV35S, 
NOS, OCS, Patatin, E9, alcA/alcR switch, GST switch, RMS switch, oleosin, ribulose 
bisphosphate carboxylase-oxygenase small sub-unit, actin 7, Cestrum or root specific 
promoters including MR7 promoter (maize), Gos 9 (rice), GOS2 promoters, a superMas 
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promoter as described in PCT Publication No. WO 95/14098 and US Patent No. 5,955,646 
(for example (Ocs^Mas), the Agrobacterium rhizogenes RolD promoter, the arabidopsis root 
specific promoters AR1 and AR6 as described in PCT Publication No. WO 01/098480 (see 
SEQ ID NO: 536 and 539 of that application, respectively) and nematode feeding site 
specific promoters or other promoters which are derivable from nematodes or are capable of 
initiating or maintaining expression at a location where the nematode feeds. In particular, 
the Cestrum promoter as described in PCT Publication No. WO 01/73087 is useful for 
directing expression of the nematicidal protein of the present invention. The present 
inventors have shown that this promoter is active in the roots of tobacco and arabidopsis and, 
in particular, that it is not down-regulated, and may in fact be upregulated, in such roots on 
nematode infection. Reference is made to Examples 14 and 15. 

Terminators that can be used in the constructs according to the present invention 
include Nos, proteinase inhibitor II, CaMV35S and the terminator of a gene of alpha-tubulin 
(European Patent Application, Publication No. EP 0 652 286 A). 

It is equally possible to use, in association with the promoter regulation sequence, 
other regulation sequences which are situated between the promoter and the sequence 
encoding the protein according to the present invention, such as transcriptional or 
translational enhancers, for example, tobacco etch virus (TEV) translation activator 
described in PCT Publication No. WO 87/07644, figwort mosaic virus (FMV) enhancer (as 
described in PCT Publication No. WO 00/66748) and cauliflower mosaic virus (CaMV) 35S 
enhancer (Fang et al 1989. The Plant Cell Vol. 1 pp 141 -150). It is also possible to use, in 
association with the promoter regulation sequence, other regulation sequences such as 
introns. The polynucleotide encoding the nematicidal protein according to the invention may 
also be codon-optimised, or otherwise altered to enhance, for example, transcription once it 
is incorporated into plant material. Such codon optimisation may also be used to alter the 
predicted secondary structure of the RNA transcript produced in any transformed cell, or to 
destroy cryptic RNA instability elements present in the unaltered transcript, thereby 
increasing the stability and/or availability of the transcript in the transformed cell (Abler and 
Green. 1996. Plant Molecular Biology (32) pp63-78). Table 1 below shows exemplary 
codon usage preferences for soyabean. 
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Table 1. Codon-usage preferences for Soyabean 



Amino Acid 


Soya Preference 


Alanine 


GCU 


Arginine 


AGA 


Asparagine 


AAC 


Aspartic Acid 


GAU 


Cysteine 


UGC 


Glutamine 


CAA 


Glutamic Acid 


GAG 


Glycine 


GGA 


Histidine 


CAU 


Isoleucine 


AUU 


Leucine 


cuu 


Lysine 


AAG 


Methionine 


ATG 


Phenylalanine 


UUC 


Proline 


CCA 


Serine 


ucu 


Threonine 


ACU 


Tryptophan 


UGG 


Tyrosine 


UAU 


Valine 


GUU 



5 

The present invention still further provides a host cell comprising a polynucleotide or 
a construct as described above. Suitable host cells include micro-organisms or cultured cells. 
Suitable micro-organisms include Escherichia coli and Pseudomonas. Suitable cultured 
cells include cultured insect cells, cultured mammalian cells and plant cells. The proteins' 
10 expressed in the host cells described above may be used in the aspects and embodiments of 
the invention described herein. In addition, these proteins may be used in the production of 
antibodies using techniques well known in the art. Antibodies produced using the proteins of 
the invention are suitable for use in screening assays, for example. 

The present invention still further provides a transgenic plant comprising a 
1 5 polynucleotide or a construct as described above. 

The present invention still further provides a method of providing a plant or plant part 
with a nematicidal protein comprising: (a) inserting into the genome of the plant or of plant 
material a polynucleotide or a construct as described above; and (b) regenerating plants or 
plant parts therefrom; and (c) selecting those plants or plant parts having said protein. The 
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said polynucleotide or construct may be incorporated into the cells by plant transformation 
techniques that are well known to the person skilled in the art. Such techniques include but 
are not limited to particle mediated biolistic transformation, Agrobacterium-ms&dXz& 
transformation, protoplast transformation (optionally in the presence of polyethylene 
glycols); sonication of plant tissues, cells or protoplasts in a medium comprising the 
polynucleotide or vector; micro-insertion of the polynucleotide or vector into totipotent plant 
material (optionally employing the known silicon carbide "whiskers" technique), 
electroporation and the like. 

The present invention still further provides plants or plant parts obtained according to 
the method of the preceding paragraph. In the context of the present invention, plant parts 
include, but are not limited to, protoplasts, cells, seeds, fruits, leaves, flowers and the like 
and any other part of the plant that can be reproduced either sexually, asexually or both. In a 
particular embodiment the plants or plant parts of the present invention are selected from the 
group consisting of: melon, mango, soybean, cotton, tobacco, sugar beet, oilseed rape, 
canola, flax, sunflower, potato, tomato, alfalfa, lettuce, maize, wheat, sorghum, rye, banana, 
barley, oat, turf grass, forage grass, sugar cane, pea, field bean, rice, pine, poplar, apple, 
peach, grape, strawberry, carrot, cabbage, onion, citrus, cereal or nut plants or any other 
horticultural crops. Plants and plant parts in accordance with the present invention show 
improved resistance or enhanced tolerance to a nematode pest when compared to control-like 
or wild-type plants. Resistance may vary from a slight increase in tolerance to the pest to 
total resistance so that the plant is unaffected by the presence of pest (where the pest is 
severely inhibited or killed). 

The invention still further provides plants or plant parts as described above which 
comprise a further agronomic trait selected from the group consisting of herbicide resistance, 
insect resistance, fungus resistance, nematode resistance, altered stress tolerance, altered 
yield, altered nutritional content and any other desired agronomic trait. In a particular 
embodiment of the present invention the further agronomic trait provides resistance to a 
herbicide which comprises glyphosate acid or agriculturally acceptable salt thereof. 

The invention still further provides plants or plant parts as described above which 
comprise a pharmaceutical trait. 

The present invention still further provides the use of a polynucleotide or a construct 
as described above in a method of producing plants which are resistant and/or tolerant to 
nematodes. 
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The present invention still further provides the use of at least one protein as described 
above as an active ingredient in the production of a nematicide. In a still further embodiment 
of the present invention the nematicide further comprises an agriculturally acceptable carrier 
and/or a diluent and/or a nemtatode attractant and may be formulated for use as a spray. 

The present invention still further provides a method of controlling nematodes 
comprising providing at a locus where said nematodes feed at least one protein as described 
above. 

The present invention still further provides a method of controlling nematodes 
comprising providing at a locus where said nematodes feed a mannanase. 

The present invention still further provides a composition comprising a nematicidally 
effective amount of at least one protein as described above and an agriculturally acceptable 
carrier and/or a diluent and/or a nematode attractant. In a particular embodiment, the 
composition comprises a combination of at least two of the proteins described above. 

The present invention still further provides a composition comprising a nematicidally 
effective amount of at least one protein as described above and at least one further protein 
and an agriculturally acceptable carrier and/or a diluent and/or a nematode attractant. In a 
particular embodiment, the further protein may be a herbicidal protein, a fungicidal protein, 
an insectidal protein or another nematicidal protein. The nematicidal proteins according to 
the present invention may also be combined in application with other agrochemicals such as 
herbicides, fungicides, insecticides and nematicides. Examples of possible mixture partners 
include insecticidal and nematicidal lectins, insecticidal protease inhibitors and insecticidal 
proteins derived from species of the Bacillus thurigiensis, Xenorhabdus nematophilus, or 
Photorabdus luminescens and other chemicals for example pyrethroids, carbamates, 
imidacloprid, organochlorines, macromolecules such as spinosad abamectin or emamectin. 

The present invention still further provides a recombinant micro-organism which 
provides for production of a protein as described above. In a further embodiment of the 
invention the recombinant micro-organism is an endophyte or a Pseudomonas sp. An 
endophyte is generally accepted within the art as a micro-organism having the ability to enter 
into non-pathogenic endosymbiotic relationships with a plant host. A method of 
endophyte-enhanced protection of plants has been described in a series of patent applications 
by Crop Genetics International Corporation (for example, PCT Publication No. WO 
90/13224, European Patent Publication No. EP 0 125 468 Bl, PCT Publication No. WO 
91/10363, PCT Publication No. WO 87/03303). PCT Publication No. WO 94/16076 
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describes the use of endophytes which have been genetically modified to express a 
plant-derived insecticidal peptide. 

The present invention also envisages that the proteins of the invention are active 
against nematodes which infect animals or humans. Accordingly, there is provided a 
pharmaceutical composition comprising at least one protein as described above and a 
pharmaceutical^ acceptable carrier. 

In a further aspect of the present invention, there is provided a protein as described 
above for use as a pharmaceutical or a vaccine. 

In a still further aspect the present invention provides for the use of a protein as 
described above in the manufacture of a medicament for vaccination against or treatment of 
animal or human nematode infection. 

The nematodes to be controlled by the proteins of the present invention may include 
but are not limited to: Heterodera sp.; H. schachtii; H. glycines; H. avenae; Meliodogyne 
sp.; M incognita; M.javanica; M. hapla; M. arenaria; Globodera sp. including G. 
rostochiensis; G. pallida; Tylenchulus sp.; Rotylenchulus sp.; Xiphinema sp.; Longidoms sp.; 
Trichodorus sp.\ Paratrichodorus sp.; Scutellonema sp.; Helicotylenchus sp.; 
Hirschmanniella sp.; Pratylenchus sp.; Ditylenchus sp.; Radolpholus sp.; Aphelenchoides; 
Anguina Criconemoides; Criconema Hemicycliophora; Hemicriconemoides and 
Belonolaimus. 

The invention will now be described by way of the following non-limiting examples 
in combination with the following Figures and Sequence Listing of which: 

Figure 1 : Sequence of clone pLnuda-cl8 (SEQ ID NO: 30) aligned with the corresponding 
peptide sequence (SEQ ID NO: 31). The nucleotide is shown as a double stranded 
polynucleotide sequence (SEQ ID NO: 30 and its complement, SEQ ID NO: 38) with the 
corresponding translated peptide sequence (SEQ ID NO: 31) aligned above the coding 
region. The positions of the L. nuda tryptic peptide sequences (SEQ ID NOs: 3, 4, 5 and 6) 
and the 5' UTR are indicated. 

Figure 2: Consensus sequence of six 5' RACE cDNA clones (Lnuda-c25, c26, c27, c30, c31 
and c32; SEQ ID NO: 32) aligned with the corresponding peptide sequence (SEQ ID NO: 
33). The nucleotide is shown as a double stranded polynucleotide sequence (SEQ ID NO: 32 
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and its complement, SEQ ID NO: 39) with the corresponding translated peptide sequence 
(SEQ ID NO: 33) aligned above the coding region. The positions of the translation start 
codon and the 5' UTR are indicated. 

Figure 3: Consensus sequence of seven 3' RACE cDNA clones (Lnuda-c37, c38, c40, c41, 
c42, c47 and c48; SEQ ID NO: 34) aligned with the corresponding peptide sequence (SEQ 
ID NO: 35). The nucleotide is shown as a double stranded polynucleotide sequence (SEQ ID 
NO: 34 and its complement, SEQ ID NO: 40) with the corresponding translated peptide 
sequence aligned above the coding region. The positions of three polymorphisms, the 
translation stop codon, the 3* UTR and the poly A tail are indicated. 

Figure 4: Sequence of full-length cDNA clone Lnuda-c57 (SEQ ID NO: 36) aligned with 
the corresponding protein sequence (SEQ ID NO: 1). The nucleotide is shown as a double 
stranded polynucleotide sequence (SEQ ID NO: 36 and its complement, SEQ ED NO: 41) . 
with the corresponding translated protein sequence aligned above the coding region. The 
positions of three polymorphisms, the translation start codon and the translation stop codon 
are indicated. 

Figure 5: Schematic representation of the positions of the translation start and stop codons 
and the four introns and four exons of the on a full-length gDNA map of the L.nuda gene for 
the nematicidal protein of SEQ ID NO: 1 . In this figure, the translation start and stop codons 
are indicated by vertical black bars, the exons by striped arrows, the introns by a transparent 
bar and flanking regions (the 5* and 3' UTRs) by a horizontal black bar. 

Figure 6: Consensus sequence of twenty full-length gDNA clones (SEQ ED NO: 37) aligned 
with the correseponding protein sequence (SEQ ID NO: 1). The nucleotide is shown as a 
double stranded polynucleotide sequence (SEQ ED NO: 37 and its complement, SEQ ID NO: 
42) with the corresponding translated protein sequence aligned above the coding region. The 
positions of the translation start and stop codons and the introns and exons are indicated. 

SEQ ID NO: 1 is an exemplary nematicidal protein of the invention. 
SEQ ID NO: 2 is a polynucleotide encoding the protein of SEQ ED NO: 1. 
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SEQ ID NO: 3 to SEQ ID NO: 1 1 are protein fragments identified by de novo protein 
sequencing. 

SEQ ID NO: 12 to SEQ ID NO: 29 are primers used in the present invention. 
SEQ ID NO: 30 is the sequence of clone pLnuda-cl8. 
5 SEQ ID NO: 31 is the peptide sequence corresponding to SEQ ID NO: 30. 

SEQ ID NO: 32 is the sequence identified in six 5* RACE cDNA clones (Lnuda-c25, c26, 
c27, c30, c31 andc32). 

SEQ ID NO: 33 is the peptide sequence corresponding to SEQ ID NO: 32. 

SEQ ID NO: 34 is the sequence identified in seven 3' RACE cDNA clones (Lnuda-c37, c38, 
10 c40, c41, c42, c47 and c48). 

SEQ ID NO: 35 is the peptide sequence corresponding to SEQ ID NO: 34. 

SEQ ID NO: 36 is the sequence of full-length cDNA clone Lnuda-c57. 

SEQ ID NO: 37 is the consensus sequence of 20 full-length gDNA clones. 

SEQ ID NO: 38 to SEQ ID NO: 42 are complementary sequences to SEQ ID NO: 30, 32, 34, 
15 36 and 37, respectively. 

SEQ ID NO: 43 is a protein fragment identified by de novo protein sequencing. 

EXAMPLES 

20 General molecular biology methods are carried out according to Sambrook et al 

(1989) 'Molecular cloning: A laboratory Manual, 2nd Edition. Cold Spring Harbour Lab. 
Press. 

Example 1 : Protein sequencing 

25 a) De novo protein sequencing of L. nuda (15kDa band) 

A crude extract from L. nuda was shown to have nematicidal activity using the 
method of Example 6. A 15kDa protein band was purified from this extract using the 
method of Example 5. Tryptic peptides from this protein sample were prepared for 
nanaospray MS/MS sequencing by digestion using trypsin of either solution or 

30 polyacrylamide gel slices containing the protein sample. After digestion with trypsin the 
peptides were desalted using C|g zip tips ™. The peptides were eluted from the zip tips 
using 50:50:0.1 acetonitrile: water: formic acid and directly transferred into a nano 
electrospray needle. Mass spectrometry of the desalted peptides was carried out using a 
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micromass Q-TOF 2 mass spectrometer. The mass spectrometer was calibrated using the 
product ions from the fragmentation of the doubly charged ion of glu-fibrinopeptide B in 
50:50:0.1 acetonitrile: water: formic acid at SOOfmol/jil. Ions from the nanospray needle 
were detected in the Q-TOF mass spectrometer running in survey mode. Multiply charged 
ions were then selected one at a time and fragmented by collision with argon gas to form 
fragment ions. The fragment ion spectra obtained are referred to as MS/MS spectra. The 
most appropriate collision energy for each peptide was selected manually by the operator. 
Under the low energy conditions used the predominant cleavage occurs at the peptide bond 
generating 2 types of ions, the y-ions formed from the C-terminus of the peptide and the b- 
ions generated from the N-terminus of the peptide. By measuring the difference in masses 
between different ions in the MS/MS spectra it is possible to propose amino acid sequences. 
Using the above approach the following peptide sequences were proposed: 

E(Q/K)ELV(Q/K)SG(Q/K)TYLLTNAK (SEQ ED NO: 3) 
LVAVTTPVEWHLWHDEVDHT(366.21) (SEQ ID NO: 4) 
WSSEMYLGLNGSPSDGTK (SEQ ID NO: 5) 
AVTTPVEWHLWHDEVD(623.23) (SEQ ID NO: 6) 

SGNLGLYF or FYLGLNGS (SEQ ID NO: 7 and SEQ ID NO: 8, respectively) 
(243. 19)TVDLS(1 304.65) (SEQ ID NO: 9) 
(1716.75)(Q/K)SAAPGSSHTTGEYTWK (SEQ ID NO: 10) 
(3821.49)NSVYTWK (SEQ ID NO: 1 1) 

Non standard nomenclature used is as follows: 

L means Leucine or Isoleucine 

(Q/K) means Glutamine or a Lysine at this position 

(623.23) means unable to assign remaining 623.23 dalton fragment 

F means Phenylalanine or oxidized Methionine at this position 

b) Edman sequencing of L. nuda ( 15kDa band) 

Whole protein samples that had been blotted onto PVDF were N-terminally 
sequenced by Edman degradation using an ABI Procise cLC Sequencer using programmes 
supplied by the manufacturer. 
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The result indicated that the protein is blocked. By comparison of the MS/MS 
sequencing data with the predicted protein sequence there was a clear indication that the 
protein starts with an acetylated Serine i.e the first Methionine residue is missing. 

In view of the results of the Edman sequencing of the N-terminus of the L nuda 
5 protein, the N-terminal peptide sequence (SEQ ID NO: 3) was revised to read as follows: 

acetylated -S(Q/K)ELV(Q/K)SG(Q/K)TYLLTNAK (SEQ ID NO: 43) 

Example 2: Cloning of cDNA and gDNA 
10 a. Cloning of Partial cDNA 

Nested degenerate oligonucleotides were designed to proposed tryptic peptide 
sequences generated from MS/MS derived data as detailed in Example 1. The following 
tryptic peptide sequences were used: 

15 Peptide 1 (SEQ ID NO: 3) E(Q/K)ELV(Q/K)SG(Q/K)TYLLTNAK m.wt. = 1920.86 

Peptide 2 (SEQ ID NO: 4) LVAVTTPVEWHLWHDEVDHT(366.21 ) m.wt. = 2749.36 

Peptide 3 (SEQ ID NO: 5) WSSEMYLGLNGSPSDGTK m.wt. = 1 927.76 

Peptide 4 (SEQ ID NO: 6) AVTTPVEWHLWHDEVD(623.23) m.wt. = 2537.13 



20 



25 



30 



The following nested oligonucleotides were designed: 

Lnuda-IF GGIMARACITAYHTIHTIAC (SEQ ID NO: 12) 

Lnuda-2F ARGARHTIGTIMARWSIGG (SEQ ID NO: 1 3) 

Lnuda-IR GCRTTIGTIADIADRTAIGT (SEQ ID NO: 14) 

Lnuda-2R TKICCIS WYTKI ACIADYTC (SEQ ID NO: 1 5) 

Lnuda-3F GTIGARTGGCAYHTITGGCA (SEQ ID NO: 16) 

Lnuda-4F ACIACICCIGTIGARTGGCA (SEQ ID NO: 17) 

Lnuda-3R TGRTCIACYTCRTCRTGCCA (SEQ ID NO: 1 8) 

Lnuda-4R TCRTGCCAIADRTGCCAYTC (SEQ ID NO: 19) 

A 418 bp PCR product was amplified from an amplified L. nuda cultivated 
mushroom cDNA library in XZapII (Stratagene #200400/200401/200450) using nested 
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degenerate Lnuda primers in combination with nested T3 promoter specific primers, as 
follows: 

Primary PCR: Lnuda-3R + T3 (AATTAACCCTCACTAAAGGG; SEQ ID NO: 20) 

Secondary PCR: Lnuda-4R + T3-nest (ACTAAAGGGAACAAAAGCTGG; SEQ ID 
NO: 21) 

25 ^1 PCR amplification reactions were run using puRe Taq™Ready-To-Go™PCR 
Beads (Amersham Biosciences #27-9558-01) incorporating 300ng of degenerate primer, 
lOOng of specific primer and ljxl of cDNA library template under the following conditions: 
95°C for 5 minutes, 35 cycles of 95°C for 30 seconds, 50°C for 30 seconds, 72°C for 90 
seconds, then 72°C for 6 minutes (Techne Progene thermal cycler). l|il of primary PCR was 
used as template for secondary PCR. 

The 418 bp PCR amplicon was clorfed directly into pCR® 2.1 TOPO® using TOPO 
TA Cloning Kit (Invitrogen 45-0641) and sequenced by automated cycle sequencing from 
Ml 3 forward (GTAAAACGACGGCCAG; SEQ ID NO: 22) and reverse 
(CAGGASAACAGCTATGAC; SEQ ID NO: 23) primers. 

The sequence of the 41 8 bp Lnuda PCR amplicon (clone pLnuda-cl 8) was 
determined to be: 

TACTAAAGGGAACAAAAGCTGGAGCTCCACCGCGGTGGCGGCCGCTCTAGAACTAGTGGATCCCCCGGGCTGCA 
GGAATTCGGCACGAGGAACTTTCTGCCTCGTTTTTTTGCTCCTACTGTTTTTCTCTTCCAGTTTCTACCATGTC 
GCAAGAAATTGTTCyuVTCAGGACAAACCTACATCATCACTAACGCCAAATCCGGCACAGTTGTTGACCTTTCGG 
GCGAAGACAACAAATCTATTATTGGATTTCCCAAGCATGGAGGAACAAATCAGAGGTGGACCCTCAACTGGACA 
GGGAAGAGTTGGACTTTCCGCTCCGTTTCTTCTGAAATGTATCTTGGCCTGAATGGCTCGCCGTCTGATGGAAC 
AAAACTGGTAGCCGTGACCACCCCTGTTGAGTGGCGCATCTGGCACGA (SEQ ID NO: 30) 

This nucleotide sequence, when translated from the open reading frame starting at the 
ATG codon at nucleotides 144 to 146, corresponds to the following peptide sequence: 

MSQEIVQSGQTYIITNAKSGTVVDLSGEDNKSIIGFPKHGGTNQRWTLNWTGKSWTFRSVSSEMYLGLNGSPSD 
GTKLVAVTTPVEWRIWH (SEQ ED NO: 31) 
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Figure 1 shows this nucleotide sequence (pLnuda-cl8; SEQ ID NO: 30) aligned with 
the corresponding peptide sequence (SEQ ID NO: 31). The nucleotide is shown as a double 
stranded polynucleotide sequence (SEQ ID NO: 30 and its complement, SEQ ID NO: 38) 
with the corresponding translated peptide sequence (SEQ ID NO: 31) aligned above the 
coding region. The positions of the L. nuda tryptic peptide sequences (SEQ ID NOs: 3 4 5 
and 6) and the 5' UTR are indicated. All major tryptic peptides are represented on the 418b P 
cDNA clone thereby confirming authenticity. 

b. Cloning of /. rn.Jn 5' & 3' cDNA onHc 

L. nuda gene specific nested forward and reverse primers were designed in order to 
PCR amplify both 5 ' and 3 ' cDNA ends: 

Lnuda-Sl TCATCACTAACGCCAAATCCG (SEQ ID NO: 24) 
Lnuda-S2 TTGTTGACCTTTCGGGCGAAG (SEQ ID NO: 25) 
Lnuda-S3 TTCAGAAGAAACGGAGCGG (SEQ ID NO: 26) 
Lnuda-S4 TCCAACTCTTCCCTGTCCAG (SEQ ID NO: 27) 

Using*, nuda total RNA, 5' and 3' cDNA ends were PCR amplified using First 
Choice* RLM-RACE Kit (Ambion #1 700) as per manufacturers instructions. 

A 21 1 bp 5' RACE PCR amplicon and a 493 bp 3' RACE PCR amplicon were 
cloned directly into pCR® 2. 1 TOPO® and sequenced. 

L.nuda 5* RACE consensus seq uence- 

Six 5' RACE cDNA clones were identified (Lnuda-c25, c26, c27, c30, c31 and c32) 
having the following consensus sequence: 

GCCTCGTTTTTTTGCTCCTACTGTTTTTCTCTTCCAGTTTCTACCATGTCGCAAGAAATTGTTCAATCAGGACA 
AACCTACATCATCACTAACGCCAAATCCGGCACAGTTGTTGACCTTTCGGGCGAAGACAACAAATCTATTATTG 
GATTTCCCAAGCATGGAGGAACAAATCAGAGGTGGACCCTCAACTGGACAGGGAAGAGTTGGA (SEQ ID 

NO: 32) 

This nucleotide sequence corresponds to the following peptide sequence; the ATG 
translation start codon being found at nucleotides 46 to 48: 
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MSQEIVQSGQTYIITNAKSGTWDLSGEDNKSIIGFPKHGGTNQRWTLNWTGKSW (SEQ ID NO: 33) 

Figure 2 shows the consensus sequence of the six 5' RACE cDNA clones (Lnuda- 
c25, c26, c27, c30, c3 1 and c32; SEQ ID NO: 32) aligned with the corresponding peptide 
sequence (SEQ ID NO: 33). The nucleotide is shown as a double stranded polynucleotide 
sequence (SEQ ID NO: 32 and its complement, SEQ ID NO: 39) with the corresponding 
translated peptide sequence (SEQ ID NO: 33) aligned above the coding region. The 
positions of the translation start codon and the 5' UTR are indicated. 

L.nuda 3' RACE consensus seq uent^- 

Seven 3' RACE cDNA clones were identified (Lnuda-c37, c38, c40, c41, c42, c47 
and c48) having the following consensus sequence. 

TTGTTGACCTTTCGGGCGAAGACAACAAATCTATTATTGGATTTCCCAAGCATGGAGGAACAAATCAGAGGTGG 
ACCCTCAACTGGACAGGGAAGAGTTGGACTTTCCGCTCCGTTTCTTCTGAAATGTATCTTGGCCTGAATGGCTC 
GCCGTCTGATGGAACAAAACTGGTAGCCGTGACCACCCCTGTTGAGTGGCACATCTGGCACGACGAAGTTGACC 
CTTCAACTTATCGTATCTTTGTACCTTTCACCACATTCAACATGGACCTCTACGCCCARGGTAGTGCCGCCCCT 
GGTACGCCTATCACAACTTGGTATACATGGAAGGGYATCCACCAAACGTGGAGGTTTGAACTAGCTTAGGKTCA 
GGTTTCGGATGTAATTTGTGTGTGTAAATCTTCTTGGACCATGTTGTGCTTTTATTGTACTCCGCTTGTTATCA 
TTATACCCACCTATGTTGCAACATCTTTTTGGATCCCAAAAAAAAAAAA (SEQ ID NO: 34) 

This nucleotide sequence corresponds to the following peptide sequence, translation 
being carried out starting from the GTT codon at positions 3 to 5 and finishing at the TAG 
codon at positions 362 to 364: 



VDLSGEDNKS I IGFPfCHGGTNQRWTLNWTGKS WTFRS VSS EM YLGLNGS PSDGTKLVAVTTPVEWHIWHDEVDP 
STYRIFVPFTTFNMDLYAQGSAAPGTPITTWYTWKGIHQTWRFELA (SEQ ID NO: 35) 

Two polymorphisms were detected within the coding region (A/G and C/T; shown 
as R and Y in SEQ ID NO: 34) and one outside of the coding region (T/G; shown as K in 
SEQ ID NO: 34). The two polymorphisms within the coding region are in the 3 rd position 
and therefore the amino acid sequence is not affected. 

Figure 3 shows the consensus sequence of the seven 3' RACE cDNA clones 
(Lnuda-c37, c38, c40, c41, c42, c47 and c48; SEQ ID NO: 34) aligned with the 



WO 2005/012340 



-19- 



PCT/GB2004/003231 



corresponding peptide sequence (SEQ ID NO: 35). The nucleotide is shown as a double 
stranded polynucleotide sequence (SEQ ID NO: 34 and its complement, SEQ ID NO: 40) 
with the corresponding translated peptide sequence aligned above the coding region. The 
positions of three polymorphisms, the translation stop codon, the 3' UTR and the poly A tail 
are indicated. 



c. Cloning o f full-length I. nuda cDNA 

L. nuda gene specific primers were designed to conserved 5* and 3' UTR regions in 
order to PCR amplify full-length clones using cDNA template as generated by the RLM- 



) RACE Kit: 



Lnuda-S5 TCTCTTCCAGTTTCTACCATG (SEQ ID NO: 28) 
Lnuda-S6 ACAAATTACATCCGAAACCTG (SEQ ID NO: 29) 

25 ul PCR amplification reactions were run using puRe Taq™Ready-To-Go™PCR 
Beads (Amersham Biosciences #27-9558-01) incorporating lOOng of each specific primer 
and lul of cDNA template under the following conditions: 95°C for 5 minutes, 35 cycles of 
95°C for 30 seconds, 62°C for 30 seconds, 72°C for 90 seconds, then 72°C for 6 minutes 
(Techne Progene thermal cycler). 

A 491 bp PCR amplicon was cloned directly into pCR®2.1 TOPO® using TOPO TA 
Cloning Kit (Invitrogen 45-0641) and sequenced by automated cycle sequencing from M13 
forward (GTAAAACGACGGCCAG; SEQ ID NO: 22) and reverse 
(CAGGASAACAGCTATGAC; SEQ ID NO: 23) primers. 

A total of 17 full-length L. nuda cDNA clones were sequenced. The sequence of 
clone Lnuda-c57, which is representative of the consensus sequence, is given below: 

TCTCTTCCAGTTTCTACCATGTCGCAAQAAATTGTTCAATCAGGACAAACCTACATCATCACTAACGCCAAATC 

CGGCACAGTTGTTGACCTTTCGGGCGAAGACAACAAATCTATTATTGGATTTCCCAAGCATGGAGGAACAAATC 

AGAGGTGGACCCTCAACTGGACAGGGAAGAGTTGGACTTTCCGCTCCGTTTCTTCTGAAATGTATCTTGGCCTG 

AATGGCTCGCCGTCTGATGGAACAAAACTGGTAGCCGTGACCACCCCTGTTGAGTGGCACATCTGGCACGACGA 

AGTTGACCCTTCAACTTATCGTATCTTTGTACCTTTCACCACATTCAACATGGACCTCTACGCCCAAGGTAGTG 

CCGCCCCTGGTACGCCTATCACAACTTGGTATA<^TGGAAGGGCATCCACCAAACGTGGAGGTTTGAACTAGCT 
TAGGGTCAGGTTTCGGATGTAATTTGT (SEQ ID NO: 36) 
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The open reading frame within this region coding for a protein product is given 

below: 

ATGTCGCAAGAAATTGTTCAATC^ 

TTCGGGCGAAGACAACAAATCTATTATTGGATTTCCCAAGCATGGAGGAACAAATCAGAGGTGGACCCTCAACT 

GGACAGGGAAGAGTTGGACTTTCCGCTCCGTTTCTTCTGAAATGTATCTTGGCCTGAATGGCTCGCCGTCTGAT 

GGAACAAAACTGGTAGCCGTGACCACCCCTGTTGAGTGGCACATCTGGCACGACGAAGTTGACCCTTCAACTTA 
TCGTATCTTTGTACCTTTGACCACATTCAACATC 

TCACAACTTGGTATACATGGAAGGGCATCCACCAAACGTGGAGGTTTGAACTAGCTTAG (SEQ ID NO: 2) 

The L. nuda gene is comprised of a 429 bp open reading frame (SEQ ID NO: 2), 
encoding 142 amino acids (SEQ ID NO: 1), thereby giving a predicted molecular weight of 
15962.85 Daltons. There appears to be only one gene encoding for the L nuda nematicidal 
protein of the present invention. 

The confirmed protein sequence coded for by SEQ ID NO: 2, and nucleotides 19 
to 447 of SEQ ID NO: 35, is as follows: 

MSQEIVQSGQTYIITNAKSGTWDLSGEDNKSIIGFPKHGGTNQRWTI^TGKSWTFRSVSSEMYLGLNGSPSD 
GTKLVAVTTPVEWHIWHDEVDPSTYRIFVPFTTFNMDLYAQGSAAPGTPITTWYTWKGIHQTWRFELA (SEQ 

ID NO: 1) 

This protein is a non-secretory protein as predicted by SignalP. When BlastP 
searched, there was only one significant match: 55% identity to an Agaricus bisporus 
putative mannanase. Over the full protein length, there is 52.1% identity to the A.bisporus 
putative mannanase. 

Figure 4 shows the sequence of the full-length cDNA clone Lnuda-c57 (SEQ ID NO: 
36) aligned with the corresponding protein sequence (SEQ ID NO: 1). The nucleotide is 
shown as a double stranded polynucleotide sequence (SEQ ID NO: 36 and its complement, 
SEQ ID NO: 41) with the corresponding translated protein sequence aligned above the 
coding region. The positions of three polymorphisms, the translation start codon and the 
translation stop codon are indicated. 
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d. Cloning of fiill-len gth L.nuda p T)NA 

Primers Lnuda-S5 and Lnuda-S6 were used to PCR amplify the full-length coding 
region from Lnuda cultivated mushroom genomic DNA (gDNA). Amplification conditions 
were as described above. 

A resulting 706 bp amplicon was TOPO TA cloned and sequenced, as previously 
described. The consensus sequence for 20 positive clones is shown below: 

TCTCTTCCAGTTTCTACCATGTCGCAAGAAATTGTTCAATCAGGACAAACCTACATCATCACTAACGCCAAATC 

CGGCACAGTTGTTGACCTTTCGGGCGAAGACAACAAATCTAGTAAGTCGTTTTTAGTCCCATGTTTTTTTTTGT 

CAAAAAAAATTGACTGACATATTTTGTCTCCAGTTATTGGATTTCCCAAGCATGGAGGAACAAATCAGAGGGTA 

GGTCTAGAAATGCACCTCGTTAATATTGGTTTTTATTGACATTCATGAACAGTGGACCCTCAACTGGACAGGGA 

AGAGTTGGACTTTCCGCTCCGTTTCIT'CTGAAATGTATCTTGGCCTGAATGGCTCGCCGTCTGATGGAACAAAA 

CTGGTAGCCGTGACCACCCCTGTTGAGTGGCACATCTGGCACGACGAAGTTGACCCTTCAACTTATCGGTGAGT 

CCCCTAAATATTACTTGCTTGTGGTTCATACTAATACGTCGTTCGAAGTATCTTTGTACCTTTCACCACATTCA 

ACATGGACCTCTACGCCCAGGGTAGTGCCGCCCCTGGTACGCCTATCACAACTTGGTATACATGGAAGGGTATC 

CACCAAACGTGGAGGTTTGAACTAGGTAGGGCTTGCGATCTCACCCGGATCCTCCATGAACTAATGTGATCACG 

TCGTGTTCTAGCTTAGGTTCAGGTTTCGGATGTAATTTGT (SEQ ID NO: 37) 

No polymorphisms were detected. The amino acid sequence is 100% identical to that 
translated from the cDNA, as predicted. Introns were mapped according to the GT:AG rule. 
Four intronic regions could be mapped to the gDNA sequence: 

Intron 1 66 bp 

Intron 2 55 bp 

Intron 3 54 bp 

Intron 4 60 bp 



Intron 4 splices within the very last codon (A) immediately upstream of the TAG stop 
codon. Figure 5 shows a schematic representation of the positions of the translation start and 
stop codons and the four introns and four exons on a full-length gDNA map of the Lnuda 
gene encoding for the nematicidal protein of SEQ ID NO: 1. Figure 6 shows the consensus 
sequence of the twenty full-length gDNA clones (SEQ ID NO: 37) aligned with the 
correseponding protein sequence (SEQ ID NO: 1). The nucleotide is shown as a double 
stranded polynucleotide sequence (SEQ ID NO: 37 and its complement, SEQ ID NO: 42) 
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with the corresponding translated protein sequence aligned above the coding region. The 
positions of the translation start and stop codons and the introns and exons are indicated. 

Example 3: Construction of vectors. 

The person skilled in the art is capable of producing the polynucleotides and 
polynucleotide sequences according to the invention utilising standard molecular biology 
cloning techniques. In addition, various polynucleotides based on the sequence information 
as listed in the Sequence Listing and described above can be synthesised chemically using 
standard techniques well known to the person skilled in the art. 

Basic vectors are available or may be constructed which are suitable for assembling 
the polynucleotides and polynucleotide sequences according to the invention. The 
component parts of the polynucleotides may be synthesised or may be cloned from other 
vectors or libraries containing said sequences. The person skilled in the art may generate 
probes based on the information presented in the Sequence Listing to isolate the sequences 
according to the invention also using standard techniques. The vectors may be used for 
DNA work (sequencing, mutagenesis), for in vitro production and for plant transformation, 
following methods well known in the art. When providing vectors for plant transformation 
which utilise Agrobacterium, it is preferred that the sequences according to the invention are 
inserted between the border regions of a single T-DNA region. Agrobacterium may be 
transformed in accordance with methods which are well known to the person skilled in the 
art. 

Example 4: Transformation 

The vectors are transformed into plant cells using standard procedures. Any 
transformation method suitable for the target plant or plant cells may be employed, 
including, but not limited to, Agrobacterium-medizted transformation (for example, infection 
by Agrobacterium tumefaciens containing recombinant Ti plasmids, infection with 
Agrobacterium rhizogenes transformed with binary vectors to produce transgenic hairy root 
cultures (Cho H-J et al. Planta 210, 195-204)), particle mediated biolistic transformation, 
protoplast transformation (optionally in the presence of polyethylene glycols), 
electroporation, sonication of plant tissues, cells or protoplasts in a medium comprising the 
polynucleotide or vector, microinjection of plant cells and protoplasts, bacterial 
bombardment, micro-insertion of the polynucleotide or vector into totipotent plant material 
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(optionally employing the known silicon carbide "fibre" or ''whisker" method) and pollen 
tube transformation. The transformed cells may then in suitable cases be regenerated into 
whole plants in which the new nuclear material is stably incorporated into the genome. Both 
transformed monocot and dicot plants may be obtained in this way. Full details of the 
5 methods of transformation are known to the person skilled in the art. 

Example 5: Protein purification from L. nuda 

Approximately 100 gram quantities of L nuda were frozen in liquid nitrogen and 
ground using a pestle and mortar. Ground mushroom was placed into a flat-bottomed dish 
10 with plastic, perforated lid and was then freeze dried for two days until all moisture had been 
removed from the sample. 

The lyophilised mushroom was removed from the container and weighed. 20mls of 
extraction buffer was added for every lg of lyophilised mushroom. Protease inhibitor 
cocktail containing a mixture of water-soluble protease inhibitors with broad specificity for 
15 the inhibition of serine, cysteine, aspartic, and metallo-proteases (containing 4-(2- 

aminoethyl) benzenesulfonyl fluoride (AEBSF), E-64, bestatin, leupeptin, aprotinin, and 
sodium EDTA) was added (1 pot of protease inhibitor cocktail was made up to 100ml with 
the addition of ultra pure water). 1ml of protease inhibitor cocktail was used for every 100ml 
of mushroom extract. The extract was incubated overnight at 4°C with stirring to ensure the 
20 full extraction of all proteins. 

The particulates were then removed by centrifugation. Extract was centrifuged at as 
high a speed as the rotor allowed, and the supernatant removed. The supernatant was then 
filtered to ensure all particulates were removed. 

In order to concentrate and remove other contaminants, ammonium sulphate was 
25 added to the extract to 90% saturation. This was incubated at 4°C with rolling for 1 hour. The 
protein was then spun down using ss34 rotor at 20,000 rpm for 20mins. The supernatant was 
removed and the precipitated protein was re-solubilised in 20mM Tris (pH 8.0) in order to 
give to solution some buffering capacity. 

The resuspended extract was dialysed into Lactosyl Agarose Buffer A (PBS pH 7.5) 
30 using 3kDa dialysis membrane. This ensured removal of the remaining ammonium sulphate 
and allowed further purification using lactosyl agarose. This purification was carried out as 
follows. 2mls of 50% lactosyl-agarose slurry (lactosyl agarose from Sigma in 0.15M NaCl 
and 0.01 M sodium phosphate, pH 6.8) was pipetted into 15ml Falcon tube. The beads were 
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washed 4 times with lOmls of PBS. After each wash, the beads were pelleted by spinning at 
3500 rpm for 4mins (speed is important as higher speeds may disrupt the integrity of the 
beads) and as much supernatant as possible was removed without disturbing the beads. 
lOmls of mushroom extract was added to the tube which was then incubated overnight to 
allow lectin binding. The beads were then pelleted again and the supernatant is removed as 
Unbound fraction. The beads were then washed and pelleted again three times with lOmls of 
PBS solution to remove the remaining unbound proteins. lOmls of 0.4M a-lactose was then 
added and the tube was incubated at 4°C with rolling for 1 hour. The beads were pelleted 
and the supernatant removed as Bound fraction. 10(il of original extract, unbound and bound 
was run on Novex 4-12% Bis/tris SDSpage with MES buffer and Mark 12 molecular weight 
markers. 

The lactose bound fraction was concentrated 5 times and loaded onto a superdex 70 
(Amersham Biosciences) column pre-equilibrated with PBS. An isocratic gradient was run 
and 1 mL fractions collected and assayed for activity. This last step purified the nematicidal . . 
protein to apparent homogeneity. It is noted that the N-terminal methionine of SEQ ID NO: 
1 is cleaved in the pure protein extracted from L. nuda. In addition, the new N-terminal 
residue (a serine) is acetylated. 

Example 6: Recombinant protein production 

The cDNA sequence (SEQ ID NO: 2), operably linked to appropriate regulatory 
regions, was cloned into plasmid pET24a (Novagen) to allow expression of protein with 
sequence identical to SEQ ID NO: 1 and the construct was transformed into E . coli 
BL21(DE3) (Invitrogen) competent cells using standard methods. It is noted that the protein 
as expressed in E. coli lacks the N-terminal methionine residue, which has been cleaved to 
give the mature protein. The pET24a construct was also subjected to mutagenesis to allow 
expression of SEQ ID NO: 1 with an additional glycine residue encoded after the start 
methionine, to potentially mimic amino-terminal acetylation of the mature protein (referred 
to hereinafter as L. nuda G). Recombinant protein expression was induced at 20°C by 
addition of IPTG and the resultant protein was purified from soluble extract by binding to 
lactosyl-agarose and elution with a-lactose as described in Example 5. The purified protein 
was concentrated and dialysed into lOmM Tris-HCl, pH 7.5 for assay of nematocidal effect 
on H, schachtii as detailed in Example 7. 
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Example 7: Nematicidal effect on Heterodera schachtii in plants 

A micro-injection screen was developed to assess nematicidal effects of a range of 
test samples (including crude extracts containing proteins and polypeptides, partially purified 
proteins or polypeptides, purified proteins or polypeptides and recombinantly expressed 
5 proteins or polypeptides). These test samples are delivered directly to feeding sites induced 
by Heterodera schachtii juveniles feeding on Arabidopsis thaliana roots. 

Sterile Arabidopsis thaliana seedlings (2 weeks old) growing in a thin layer of Knop 
media are inoculated with 100-200 sterile hatched J2s of Heterodera schachtii and 
maintained at 24°C, 16 h light, 8h dark. Between 6-9 d post inoculation individual feeding 
10 sites of J3 stage nematodes are injected with approximately 0.2 nl of test sample 

(concentration of approximately 1000 ppm/1 mg/ml) and lucifer yellow dye (10 mM) which 
enables visualisation of oral uptake by the feeding nematode using an inverted microscope, 
Nikon Eclipse TE200 with epi-fluorescence microscope attachment. Between 7-20 
successful uptakes are carried out for each sample using several individual seedlings. 
15 Seedlings with injected feeding sites, from which nematodes have ingested sample, are 
sandwiched between two glass slides and kept at 24°C, 16 h light, 8h dark for at least 7 d 
during which time the development of the nematodes are assessed. 

The efficacy of the test sample is recorded as the number of male/female/juvenile 
stage nematodes dead or live after approximately 7-14 d. For all tests effect codes may be 
20 attached to the score, including: 

Nematode median bulb pumping 
Nematode moulting/development 
Evidence of vacuolation inside nematode 

25 

Using this microinjection assay, a crude extract from L. nuda (as referred to in 
Example 1), protein purified from Z. nuda using the method of Example 5 (sample referred 
to in Example 5, after loading on superdex 70) and recombinant L. nuda protein (prepared 
using the procedure of Example 6) was tested for nematicidal activity. The results of these 
30 assays are shown in the table below: 
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Table 2. Nematidal activity of native and recombinant L. nuda protein 



% Assessment < 
Sample Concentration Uptakes Dead mortality toxicity 



Controls 












Lucifer yellow (negative 
control) 


10 mM 


32 


5 


16 




Arsenic acid (positive 
control) 


2% 


24 


17 


71 


+ 


Purified 












L. nuda crude extract 


0.9 mg/ml 


8 


7 


88 


+ 


L. nuda lactosyl-agarose 
bound and purified 


0.9 mg/ml 


8 


6 


75 


+ 


Recombinant 












Recombinant L. nuda G 


1 mg/ml 


8 


8 


100 


+ 


Recombinant L. nuda 


0.25 mg/ml 


7 


5 


71 


+ 



As can be seen, the crude extract, purified protein and both forms of recombinant 
5 protein showed activity equivalent or better than that shown the positive control (2% arsenic 
acid). 



Example 8: Transformation of soybean (Glycine max) 

Soybean cotyledon explants are harvested and wounded with a scalpel dipped in a 

10 culture of A. rhizogenes containing binary vectors as described in Example 3 and grown 
overnight in LB medium containing 150mg/l kanamycin. After wounding, the explants are 
dried on sterile filter paper and transferred onto co-cultivation media for 3 days. The 
explants are then transferred onto hormone-free media supplemented with 500 mg/1 
carbenicillin and 200 mg/1 kanamycin. About 10-14 days after root emergence, l-2cm long 

15 root tips are transferred onto fresh new media and regularly sub-cultured every 5-6 weeks. 
Transgenic hairy roots may be used in feeding assays with nematodes. 

Example 9: Transformation of tobacco (Nicotiana tobacum ssp.) . 

Tobacco was transformed by co-cultivation of plant tissue with an Agrobacterium 
20 tumefaciens strain containing binary vectors as described in Example 3. The person skilled 
in the art will appreciate that alternative binary vectors may be used in association with the 
present invention. Transformation was carried out using co-cultivation of tobacco leaf 
segments from 4 week old plants. Transgenic plants were regenerated from shoots that grew 



WO 2005/012340 



-27- 



PCT/GB2004/003231 



on selection medium (containing kanamycin), rooted and transferred to germination medium 
or soil. Young plants were then grown to maturity and allowed to self-pollinate and set seed 
or used in feeding assays with nematodes. 

Example 10: Transformation of other plants 

Other plants, including, but not limited to, Arabidopsis, sugarbeet, potato and tomato, 
are transformed using methods well known to those skilled in the art. 

Example 1 1 : Analysis of transgenic hairy roots for susceptibility to plant parasitic nematodes 
(PPN) 

Eggs are obtained from females of H. glycines collected from soybean plants grown 
in a greenhouse. Eggs are washed by centrifugation in sterile distilled water, surface 
sterilized and washed again. About 200 eggs are added to recently sub-cultured transformed 
root cultures (about 200mg fresh weight). Cultures are incubated at 25°C and about 4-5 
weeks after inoculation, scored for presence of female cysts. 

Example 12: Analysis of transgenic tobacco plants for susceptibility to PPN 

Transgenic tobacco plants prepared using the method of Example 9 were assayed in 
soil for resistance against M. incognita. Transgenic plants from tissue culture were 
transferred into 3 inch pots and acclimatised in the greenhouse for 7 days. Two holes were 
made in the soil on either side of the stem, approximately 1 cm from the stem and 2 cm deep. 
1500 infective second stage juvenile (J2) Af. incognita in liquid were inoculated into the 
holes. Root knots were counted approximately 3 weeks post-inoculation. 

Example 13: In planta expression. 

Events were tested for protein expression by western blot analysis. Protein was 
extracted from root samples of soybean and tobacco hairy roots transformed as in Examples 
8 and 9 and lOjxg protein subjected to SDS-polyacrylamide gel electrophoresis alongside 
25ng recombinant protein (as prepared in Example 6), and western blot analysis performed. 
The filter was incubated with antibody raised in rabbit against recombinant protein, washed, 
then incubated with goat anti rabbit-HRP (Sigma) secondary antibody and binding detected 
by enhanced chemiluminescence. 
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Soybean hairy root events 19B1, 21E3, 26B2, 26D3, 27C1, 28E1 and 83C2 and 
tobacco events 9824, 9825, 9826, 9827, 9828 and 9829 showed expression of L.nuda 
recombinant protein as detected by this method. 

5 Example 14: TobzccolMeloidogyne incoznita promoter activity assays 

Feeding assays to determine promoter activity upon nematode infection were carried 
out on transgenic tobacco transformed via Agrobacterium tumefaciens with a construct 
containing cestrum yellow leaf curling virus promoter driving GUS reporter gene. 
Transgenic plants from tissue culture were transferred into 3 inch pots and acclimatised in 

10 the greenhouse for 7 days. Two holes were made in the soil on either side of the stem, 

approximately 1 cm from the stem and 2 cm deep. 1500 infective second stage juvenile (J2) 
M. incognita in liquid were inoculated into the holes. One week after nematode inoculation, 
roots were washed and transferred into GUS stain solution (0.1M sodium phosphate (pH7), 
ImM potassium ferrocyanide, ImM potassium ferrocyanide, ImM EDTA, 0.5% Triton X- 

15 100, ImM X-gluc), subjected to vacuum for 10 min, then placed at 37°C for 2 hours. The 
stain solution was rinsed away and replaced with 70% ethanol. Knots were visualised for 
GUS stain on a Nikon SMZ 645 microscope and scored as GUS positive or negative. Table 
.3 below shows results of knots for 4 independent events (2 clonal plants per event). 

20 Table 3. Cestrum promoter activity in tobacco roots 



Event 


# Assessed 


# GUS positive 


9619 


47 


40 


9623 


11 


11 


9625 


26 


25 


9629 


31 


29 



As can be seen, the vast majority of the assessed knots in all four events remained 
GUS positive after infection with M. incognita. 

Example 15: Arabidopsis/ Heterodera schachtii promoter activity assays 

Feeding assays to determine promoter activity upon nematode infection were carried out 
on transgenic Arabidopsis thaliana (ecotype WS) transformed via Agrobacterium tumefaciens 
with a construct containing cestrum yellow leaf curling virus promoter driving GUS reporter 



\ 
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gene. T3 seeds were sown onto agar. These seeds had been sterilised by washing in 70% EtOI 
(2 mins), soaking in 4% Na-hypochlorite (15 mins), and washing in sterile water (3 rinses then 
extended washes of 3 mins each). After sterilisation, the seeds were dried overnight in a sterile 
cabinet before being sown onto 7ml knop agar in 9cm petri dishes (6 seeds per petri, 4 petris pe 
5 transgenic line). Petris were sealed with parafilm then stored vertically in a Sanyo growth 
cabinet (25°C, 16 hr photoperiod). At 14 days after sowing, the root systems of the seedlings 
were infected with 50 sterile J2 H. schachtii per seedling. For each transgenic line seedlings on 
one petri dish were left uninfected. GUS staining was carried out in the following way at 10 
days post infection: Shoots were removed from plants grown on agar plates prior to staining 

10 whole roots directly using 5ml GUS stain solution (0.1M sodium phosphate (pH7), ImM 
potassium ferrocyanide, ImM potassium ferrocyanide, lOmM EDTA, 0.1% Triton, 0.1% X- 
GlcA) with incubation overnight at 37°C. The stain solution was rinsed away and replaced witf 
70% ethanol. Stained roots were stored at 4°C prior to examination or examined immediately. 
The presence of GUS within the syncitium 10 days after plants were infected with nematodes 

15 was scored as follows: GUS + (no change in staining compared to surrounding root); GUS +/- 
(decrease in staining, or patchy staining, compared to surrounding root); GUS - (no GUS 
staining present in feeding site). Roots were viewed using a Leica M420 whilst roots were on 
the agar plates to prevent detachment of the nematode from the root. The table below shows 
results for 2 independent transgenic events. 

20 

Table 4. Cestrum promoter activity in Arabidopsis roots. 







-/+ 


+ 


total number feeding sites 
assessed 


Cestrum. AtpBSCl 1300. JH 
Dline20L 


2.3% 


24.8% 


72.9% 


133 


Cestrum. At.pBSC 11 300. 
28.10 


6.3% 


19.8% 


73.9% 


111 



As can be seen, a majority of feeding sites showed no change in staining compared to 
25 surrounding roots. 

Other modifications of the present invention will be apparent to those skilled in the 
art without departing from the scope of the invention which is defined by the appended 
claims. All publications cited herein are hereby incorporated by reference in their entirety 
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for all purposes to the same extent as if each individual publication were specifically and 
individually indicated to be so incorporated by reference. 



